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(54) Mutation analysis by PCR and Mass spectrometry 



(57) The invention concerns mass spectrometric 
analysis of known mutation sites in the genome, such 
as single nucleotide polymorphisms (SNPs). 
The invention uses minor amounts of primers with 
photocleavable linkers, intermixed with a major 
amounts of primers without linkers, to produce short mu- 
tation-containing DNA sequences by enzymatic ampli- 
fication procedures such as polymerase chain reactions 



(PCR). Afterthis single amplification procedure, the link- 
er-containing PCR by-products are extracted, washed 
and photolytically cleaved. Short oligonucleotides are 
produced which facilitate mass spectrometric analysis. 
Additionally to the use of linkers, some types of primers 
may contain blockers which stop the polymerase copy- 
ing process to achieve even shorter oligonucleotides for 
analysis. 
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Description 

[0001] The invention concerns mass spectro metric 
analysis of known mutation sites in the genome, such 
as single nucleotide polymorphisms (SNPs). 
[0002] The invention uses minor amounts of primers 
with photocleavable linkers, intermixed with a major 
amounts of primers without linkers, to produce short mu- 
tation-containing DNA sequences by enzymatic ampli- 
fication procedures such as polymerase chain reactions 
(PCR). After this single amplification procedure, the link- 
er-containing PCR by-products are extracted, washed 
and photolytically cleaved. Short oligonucleotides are 
produced which facilitate mass spectro metric analysis. 
Additionally to the use of linkers, some types of primers 
may contain blockers which stop the polymerase copy- 
ing process to achieve even shorter oligonucleotides for 
analysis. 

Prior art 

[0003] Subject of this invention is a diagnostic method 
for the detection of actual mutative states in the genome 
DNA, whereby the possible mutation site has to be 
known beforehand. These mutative sequence changes, 
compared to the standardized sequence of a "wild type", 
may either be a base exchange ("point mutation") or the 
introduction of nucleotides ("insertion") or removal of nu- 
cleotides ("deletion"). Point mutations with a frequency 
above one percent in a population have been named 
"single nucleotide polymorphisms"; the abbreviation 
SNP has become particularly wide-spread in the recent 
literature. For humans, it is supposed that there are 
about 10 million SNPs which characterize most of the 
individually inherited differences between humans. 
They control the individual phenotypes. Roughly three 
million SNPs are estimated to be in the frequency range 
of 30 to 70 percent of the population. End of the year 
2001 , more than one and a quarter million SNPs were 
discovered and listed in the public data base NBCI of 
the worlwide acting SNP Consortium. 
[0004] For the genome of a species, it is customary 
to define a "wild type" which is regarded as free of mu- 
tation, and a "mutant" which contains a mutation. Con- 
sidering the frequency of mutations such as SNPs, and 
the equal value of mutants and wild types, the definition 
of the wild type is arbitrary or at least purely accidental, 
as already reflected in the term "polymorphism". 
[0005] Nearly all DNA mutations, including all those 
defined above, produce differences in the mass of the 
DNA segment containing the mutation in comparison to 
the mass of a corresponding segment of the wild type. 
The precise mass determination of a DNA segment can 
therefore be used for the determination of a mutation. 
Exceptions of this rule are the relatively rare "rotations", 
an interchange of two bases in a sequence. 
[0006] Mass spectrometry is a very powerful and pre- 
cise tool for determining the mass of a bio-molecule. By 



using a mass spectrometric method, such as time-of- 
flight mass spectrometry (TOF-MS) with ionization by 
matrix-assisted laser desorption and ionization (MAL- 
Dl), it is possible to analyze the ions for their masses. 

5 However, ionization can also be achieved using electro- 
spray ionization (ESI), in the latter case with mass spec- 
trometers which are frequently of a different type. 
[0007] With polymerase chain reactions (PCR), using 
a pair of "selection primers", i.e. single strand oligonu- 

10 cleotides about 20 bases long, it is possible to produce 
amounts in the order of billions of double-strand PCR 
products with a length of at least 40 base pairs in a well- 
known way. The production process for these oligonu- 
cleotides increases the number of products exponen- 
ts tially by application of temperature cycles ("thermocy- 
cles"); such processes have become known under the 
general term "amplification". The mutation site can be 
incorporated in the products by adequately choosing the 
sequences of the two selection primers. 

20 [0008] The obvious method to simply measure the 
mass of the PCR-amplified oligonucleotides as such by 
mass spectrometry, was found to be almost unworkable. 
The precise measurement of these DNA products with 
more than 40 base pairs proved itself to be almost im- 

25 possible. The reasons for this are extremely low sensi- 
tivity for long DNA products because of difficult ioniza- 
tion, high probability of adduct formation with undefined 
numbers of sodium or potassium anions, and easy frag- 
mentation of the fragile DNA products. These oligonu- 

30 cleotides have a poly-anionic character; each phos- 
phate group of the DNA backbone forms an anion and 
has to be neutralized during ionization by a proton 
(which eagerly are replaced by alkali cations if present). 
A method therefore had to be found to provide as short 

35 oligonucleotides as possible, still containing the muta- 
tion site. 

[0009] To this end, several methods of restricted, mu- 
tation-dependent primer extension using terminating 
derivatives of the nucleotide tri-phosphates have been 
40 developed in order to generate extended primers of ap- 
proximately 12 to 25 nucleotides in length only, better 
suited to identify the nature of the mutation by mass 
spectrometry. 

[0010] These methods basically consist of thefollow- 
45 jng steps: Firstly, a sufficient number of copies of the 
DNA segment containing the mutation site is produced 
by PCR using a pair of selection primers. After extraction 
and washing, these DNA segments secondly serve as 
templates for the enzymatic, mutation-dependent ex- 
50 tension of an "extension primer" by a second phase of 
thermocycling. In this second thermocycling phase, one 
to four of the nucleotide triphosphates are derivatized in 
such a manner that they serve as terminators for the 
extension, i.e., if the terminator is built in at the 3' end, 
55 a prolongation is no longer possible becausethe binding 
site is occupied. The extension primer may be identical 
with one of the two selection primers; however it is reg- 
ularly much better to use an extension primer which is 
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not identical. 

[0011] The extension primer is a short DNA chain of 
approximately 10 to 20 nucleotides and functions as a 
recognition sequence for the site of a possible mutation. 
The extension primer is synthesized with a base se- 
quence so that it can be "hybridized" or "annealed" to 
the template strand, being an exact compliment to the 
base sequence in the vicinity of a known point mutation 
site. (The attachment of a complementary strand is 
known as "hybridization" or "annealing"). 
[001 2] Different types of primer extension procedures 
have been developed, generating either products with 
equal numbers of bases for mutants and wild types, dif- 
fering only by the differences in weight of the different 
bases (9 to 40 atomic mass units as differences), or 
products with different numbers of bases (at least about 
300 atomic mass units difference) for mutants and wild 
types. The latter are easier to measure by mass spec- 
trometry, but somewhat more complicated to generate. 
In both cases, however, the PCR products of the first 
amplification cycle have to be cleaned from the nucle- 
otide triphosphates and primers, new nucleotide tri- 
phosphates (including the terminating derivatives) and 
extension primers have to be added, and another set of 
copying thermocycles have to be applied. The final 
products, about 12 to 25 bases in length, again have to 
be thoroughly washed before mass spectrometric anal- 
ysis. Primer extension procedures are complicated, us- 
ing two differentthermocycling and washing procedures 
subsequently, thus about doubling the effort of a pure 
PCR amplification. 

[001 3] The primer extension methods are widely cov- 
ered by US 6,258,538 ((H.Koster et al.). 
[001 4] All primer extension methods have to use rath- 
er expensive types of polymerases because not all 
polymerases can handle the terminating dNTP deriva- 
tives. The use of thermosequenase, especially devel- 
oped forthe Sanger method of sequencing, is highly rec- 
ommended, more inexpensive polymerases do not cor- 
rectly bind the terminators. Inexpensive polymerases, 
such as tac polymerase, can only be used in the first 
amplification by PCR. 

[0015] Unfortunately, precise determination of the 
mass of even these relatively short primer extension ol- 
igonucleotides is still difficult. With a primer extension 
method delivering products with the same number of 
bases, the mass differences between wild type oligonu- 
cleotide and mutant oligonucleotide amount to 9 to 40 
atomic mass units only. Because of the poly-anionic 
character of the DNA, various numbers of ubiquitous so- 
dium (23 atomic mass units) or potassium ions (39 
atomic mass units) are particularly likely to attach to the 
oligonucleotides (instead of protons), and so-called "ad- 
ducts" are formed. The uncertainty in the degree to 
which the adducts are formed makes any precise mass 
determination exceptionally difficult - at the very least, it 
means that cleaning has to be extremely thorough to 
avoid the usually ubiquitous presence of any sodium or 



potassium cations and all relevant process parameters 
have to be carefully monitored for being kept constant. 
[001 6] Therefore, procedures have been searched for 
to shorten even more the relatively short primer exten- 

5 sion products, including partial enzymatic digestion and 
chemical or enzymatic cleaving. These shortening pro- 
cedures force to apply even more washing processes, 
even if the washing has not to be that thoroughful. 
[0017] One of the methods to shorten the products 

10 which have to be analyzed mass spectrometrically was 
proposed by Monforte et al. (J. A. Monforte, C. H. Beck- 
er T. A. Shaler, D.J. Pollart, WO 96/37630). The authors 
proposed the use of linkers which can be chemically or 
enzymatically cleaved. The necessary introduction of 

15 chemicals for the cleaving process, however, always 
has the disadvantage of introducing traces of impurities 
which again may form adducts. In addition, chemical 
cleavage needs adjustments of other parameters of the 
solution as for instance pH values, needing more chem- 

20 jcals to be added with the danger to introduce, e. g., al- 
kali ions. Enzymatic cleaving, e. g. by restriction endo- 
nucleases, means a very restricted design of the prim- 
ers which have to offer a recognition pattern forthe nu- 
cleases and also needs adjusted buffer conditions for 

25 cleavage, making washing after cleavage a necessary 
step. 

[0018] Another method of shortening the DNA prod- 
ucts by partial digestion has been developed by Gut and 
Beck (WO 96/27681), together with a neutralization of 
30 the DNA products, generating more peptide-like prod- 
ucts. 

[0019] The MALDI preparation and measurement 
procedure consists of first embedding on a sample sup- 
port the analyte molecules into small crystals of a solid 

35 UV-absorbent matrix, usually an organic acid. The sam- 
ple support is introduced into the evacuated ion source 
of the mass spectrometer. The matrix is then evaporated 
by a short laser pulse of about 3 nanoseconds, produc- 
ing a so-called plume consisting of a weakly ionized 

40 plasma which lasts for some tens of nanoseconds be- 
fore it quickly expands into the surroundung vacuum. 
The evaporation process transports also the analyte 
molecules into the plasma plume. The analyte mole- 
cules are ionized as a result of collisions with matrix ions 

45 of the plume but, unfortunately, a condition-dependent 
and length-dependent percentage of thefragile DNA an- 
alyte molecules will be fragmented. The voltage which 
is applied to the ion source apertures accelerates the 
ions into the flighttube which has no electrical field. Due 

50 to their differing masses, the ions are accelerated to dif- 
ferent velocities. The smaller ions reach the detector 
earlier than the larger ions. The flight times are meas- 
ured and converted into ion masses. 
[0020] MALDI is ideally suited for analyzing peptides 

55 and proteins. The analysis of nucleic acid chains is 
somewhat more difficult. Even in the case of short nu- 
cleic acid chains, ionization in the MALDI process is ap- 
proximately 1 00 times less successful than it is for pep- 
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tides; the sensitivity decreases superproportionally with 
increasing mass. The reason for this is that only a single 
proton has to be captured to ionize a peptide or a pro- 
tein. For nucleic acids with multiple negative charges on 
the poly-anionic sugar-phosphate backbone (one neg- 
ative charge for each nucleotide), the ionization process 
involving such a lot of protons is considerably less effi- 
cient. The DNA products which have to be detected 
must therefore be as short as possible so that they can 
be detected well. 

[0021] In a similar way, an ionization method can also 
be used which uses a liquid with solved samples as the 
starting point. This is known as electrospray ionization 
(ESI). There are different types of mass spectrometers 
equipped with ESI ion sources, such as ion traps, FTMS, 
and time-of-flight mass spectrometers with orthogonal 
ion injection. The method is also ideally suited to the 
detection of peptides and proteins but has similar prob- 
lems with oligonucleotides. Here also, the oligonucle- 
otides which are to be detected have to be as short as 
possible. 

Objective of the invention 

[0022] It is the objective of the invention to find an 
easy procedure which produces sufficient amounts of 
ultrashort and ultraclean DNA products for mass spec- 
trometric analysis; if any possible with only a single am- 
plifying and a single washing process, thus reducing 
time, cost, and effort of sample preparation, compared 
to hitherto used methods of primer extension. 

Brief description of the invention 

[0023] The invention is based upon a single applica- 
tion of a cyclic enzymatic amplification process such as 
the polymerase chain reaction (PCR), however using in 
this process a mixture of primers without and with built- 
in photocleavable "linkers" with specified properties. 
The linker-containing primers cause the generation of 
short by-products during the amplification process 
which cannot be amplified further. After amplification, 
the short by-products are extracted, e.g. by affinity 
bonding to substrates, washed, and cleaved by UV light 
to produce even shorter analytical products, ready for 
mass spectrometric analysis. The use of "blockers" with 
specified properties in one type of the primers allows for 
even shorter analytical products. 
[0024] Thus the procedure according to the invention 
consists of only one thermocycling and one washing 
process, followed by an easy, non-polluting cleavage 
procedure using a simple UV lamp delivering the final 
analytical DNA products for mass spectrometric analy- 
sis. 

[0025] The photocleavable linkers have the following 
properties: 

1 ) the linker can replace any nucleotide in a primer 



and maintains approximately the same distance be- 
tween the neighboring nucleotides as the replaced 
nucleotide; 

5 2) the linkerdoes not hinder proper annealing of the 

primer to a complementary counter strand, whereby 
the primer can anneal to a complementary counter 
strand with an arbitrary nucleotide opposite the link- 
er; 

10 

3) the linker does not hinder enzymatic elongation 
at the 3' end by the polymerase copying process if 
the linker is a few nucleotides away from the 3' end; 

15 4) the linker stops the polymerase copying proce- 
dure if encountered in a template; and 

5) the linker is cleavable by UV light, thereby cleav- 
ing the DNA sequence. 

20 

[0026] As photocleavable linkers with the above men- 
tioned additional features, building blocks from the o- 
nitrobenzyl derivatives class of compounds are particu- 
larly suitable. After converting the o-nitrobenzyl deriva- 

25 tives into DNA building blocks or analogues, these can 
be built into the primer at any position, replacing a reg- 
ular nucleotide. Such o-nitrobenzyl derivatives do not in- 
terfere with annealing and only slightly lower the opti- 
mum annealing temperature during a DNA polymerase 

30 reaction. They are accepted by various polymerases as 
non-interfering the elongation at the 3' end if they are 
positioned a small number of nucleotides away from the 
3' end. The synthesis and mechanism of photocleavable 
1-(2-nitrophenyl)ethyl esters of various different phos- 

35 phates and thiophosphates have already been exam- 
ined in detail by Walker et al. (J. Am. Chem. Soc. 1988. 
110, 7170-7177) and Ordoukhanian and Taylor (J. Am. 
Chem. Soc. 1995, 117, 9570-9571) but no application 
to mass spectrometry has been mentioned. It should be 

40 well understood that these linkers are by no means de- 
rivatives of nucleotides by just introducing other groups 
instead of the usual bases. The linker does not hinder 
the elongation of the primer at the 3' end by polymeras- 
es, whereby some polymerases require four nucleotides 

45 at the 3' end, others can start the copying process reli- 
ably with only three nucleotides between linker and 3' 
end. It is preferred to have the linker positioned as near 
to the 3' end as possible. 

[0027] The blockers, built-in alternatively in one type 
50 of analytical primers, are defined by the following prop- 
erties: 

1) the blocker can replace a nucleotide in a primer; 

55 2) the blocker does not hinder the annealing of the 
primer to a complementary counter strand; 

3) the blocker does not hinder enzymatic elongation 
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at the 3' end by the polymerase copying process 
even if the blocker holds the 3' position; and 

4) the blocker stops the polymerase copying proce- 
dure if encountered in a template. 

[0028] As blockers, many different nucleotide deriva- 
tives can be used. There may be one blocker each for 
each of the four types of nucleotides; but this is not nec- 
essary. One of the easiest derivative usable as a blocker 
is the nucleosidethiophosphate which anneals properly, 
can be elongated by the polymerase, and stops the cop- 
ying process if encountered in a template. It is favorable 
to use not just one nucleotide thiophosphate as a block- 
er, but two or three in a row to stop the polymerase cop- 
ying process of a template reliably. 
[0029] Other types of blockers are nucleotide deriva- 
tives where the base bonded to the sugar-phosphate 
backbone is replaced by a chemical group not correctly 
forming hydrogene bridges to the counter nucleotide, or 
not even forming hydrogen bridges at all. The blockers 
are preferredly positioned directly at the 3' end of the 
primer. In cases where the polymerase has difficulties 
to start elongation, it is possible to use a single regular 
nucleotide in the position at the 3' end, directly neigh- 
bored by the blocker nucleotide or nucleotides. 
[0030] PCR amplification is thusly performed with a 
mixture of two pairs of primers: a first pair of "selection" 
primers controlling the PCR process and a second pair 
of "analytical" primers, whereby one of the analytical 
primers of the pair contains a linker, and the other ana- 
lytical primer of the pair contains either a linker or a 
blocker. The two pairs of primers can be identical, ex- 
cept forthe linker or blocker site, but preferredly the link- 
er/blocker-containing analytical primer pair is "nested" 
in the PCR products generated by the pair of selection 
primers. The linker-containing analytical primers can be 
biotinylated at their 5' end for easy immobilization at a 
st reptavi din -coated surface and washing. Of course any 
other affinity capture group can be used instead of the 
biotin, or a part of the sequence itself may be used for 
immobilization by hybridization. 

[0031 ] Favorably, the primers contain the photocleav- 
able linker about two to five nucleotides away from the 
3' position. If the second primer of the pair contains a 
blocker, the blocker should be positioned at the 3' end, 
or at least in the position next to the 3' end.. 
[0032] The PCR amplification with the mixture of the 
two pairs of primers ends up with a high number of link- 
er-containing DN A by-products which are already short- 
ened beyond the mutation site because one of the ear- 
lier copying processes already had found a linker or 
blocker in the template to be copied (see figures 2 and 
3). If the linker-containing primers are biotinylated, then 
the final products can be immobilized at a surface cov- 
ered with streptavidin, washed, and cleaved. The whole 
process produces the expected short DNA products, in- 
termixed with products considerably longer because 



they still contain, at their 3' end, the complement of the 
full selection primer. These considerably longer DNA 
products may be washed away by size-specific adsorp- 
tion, but this is not really necessary because they regu- 

5 larly do not disturb the MALDI or ESI analysis. 

[0033] Using a pair of analytical primers, each of 
which contain linkers in the fifth position from the 3' po- 
sition, the length of the short product will add up from 
four bases of one analytical primer, from four bases 

10 complementary to the other analytical primer, and from 
the length of the sequence between the primers. With 
only the mutation site between the primers, the length 
of the short product will amount to exactly 9 bases. If the 
linker can be placed nearer to the 3' end, the product 

15 can even be shorter. Samples from both strands are pro- 
duced at the same time: the analytical result from one 
strand is corroborated by the analytical result of the oth- 
er strand. If only one linker-containing analytical primer 
is biotinylated, only one strand is analysed. 

20 [0034] Using a linker-containing primer and a blocker- 
containing primer as the analytical second pair of prim- 
ers, the final product for mass spectrometric analysis is 
even still shorter: It may contain four bases from the lin k- 
er-containing primer, plus the length of the strand be- 

25 tween the analytical primers. With only the mutation site 
between the primers and with the blocker in the 3' posi- 
tion, the total length is only five bases: a pentamer is 
produced. 

[0035] The PCR yield for the short products and the 

30 amount of longer chains in the final products depends 
very much on the ratio of linker/blocker-containing to 
linker/b locker-free primers in the mixture. If the primers 
of the analytical pair of primers both contain only linkers, 
and if the annealing process of all primers has the same 

35 probability, then the following relations hold true: The 
highest yield of wanted short DNA samples for analysis, 
obtained with the lowest number of thermo cycles, is 
achieved with a mixture of roughly 7% linker-containing 
analytical primers. A 1.5-fold larger amount of longer 

40 PCR products is intermixed, but these oligonucleotide 
will not be seen in the MALDI analysis. The PCR prod- 
ucts, generated by the selection primers, amount to an 
1 6-fold surplus. The surplus of PCR products can be di- 
minished by larger percentages of analytical primers, 

45 but the ratio of analytical primers to PCR selection prim- 
ers turns out to be not very critical. A compromise is a 
mixture of 1 0 to 20 % of linker-containing analytical prim- 
ers, but easily acceptable are ratios somewhere in the 
range from 3 to 30 percent. 

50 [0036] It is one the special advantages of this inven- 
tion that the photolytic cleavage does not introduce any 
additional pollutions as is the case with all chemical or 
enzymatic cleavage methods. 

[0037] Following the PCR process, the analytical by- 
55 products are immobilized, e.g. by streptavidin-coated 
surfaces if the products are biotinylated, for a thorough 
washing. After washing, the linkers of the still immo- 
bilzed products are cleaved with a UV lamp. The free 
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cleavage products now consist of the wanted short oli- 
gonucleotides of about five to eleven bases in length for 
an analytical pair of primers with only linkers, or with four 
to six bases in length for an analytical pair of primers 
with linkers and blockers, both intermixed with an slight- 
ly higher amount of products which contain the full prim- 
er length beyond the mutation site. 
[0038] In case of MALDI ionization, the immobilization 
can directly take place on the sample support plate if 
biotinylated primers with linkers are used and the sam- 
ple locations are coated with streptavidin. Such sample 
support plates can be coated with a highly hydrophobic 
coating, leaving only hundreds of small hydrophilic an- 
chors for sample preparation. The anchors are coated 
with streptavidin, and the PCR solution is simply pipet- 
ted from a well of the microtitre plate used for PCR to 
such a sample anchor. Due to the hydrophibicity of the 
plate surrounding the anchor, the samples of different 
wells keep separated on the plate. The final analytical 
biotinylated oligonucleotides are immobilzed on the an- 
chors, and the plate with hundreds of samples is thor- 
oughly washed. After cleaving and drying, the free 
cleavage products are taken up by a pipetted drop of 
solvent with matrix substance for the MALDI process. 
After a second drying process, the support plate is ready 
for MALDI analysis in a time-of-flight mass spectrome- 
ter. 

Brief description of the figures 

[0039] Figure 1 shows the structure of the preferred 
linker. (3-cyanoethylphosporamidite can be used to re- 
place, during primer synthesis, a complete nucleotide. 
The linker bridges the neighboring nucleotides with the 
same distance as a true nucleotide, but does not contain 
any sugar (ribose). Hybridization of the linker-containing 
primer to the complementary master template is possi- 
ble with any counter-nucleotide, whereby only a small 
decrease of the melting point is observed. R 1 and R 2 
are two DNA sequences. Cleavage produces the R 2 se- 
quence for mass spectrometric measurements, where- 
by the R 2 sequence is bound to a phosphate group with 
doubly negative anion character. After protonation of 
these two anions in the MALDI process, the phosphate 
group adds 80 atomic mass units to the weight of the 
protonated R 2 sequence. 

[0040] Figure 2 presents some initial, intermediate 
and final products of the PCR procedure when two prim- 
er pairs are used to perform the PCR, one pair (pre- 
ferredly about 80 to 90 %) without linkers and one pair 
with linkers. Oligomer (2a) is a part of the original DNA 
with nucleotides "N" containing the mutation site, des- 
ignated with "P". Oligomer (2b) represents the counter 
strand; here the complementary nucleotides are named 
"M" and the complementary nucleotide of the mutation 
site is termed "Q". (2c) and (2d) represent the first primer 
pair with nucleosides "1" and "2" (the 3' end is marked 
by "-"), producing in the PCR process ample amounts 



of single-stranded DNA segments (2e) and (2f). The 
second pair (2g) and (2i) of analytical primers, with nu- 
cleotides "3" and "4" respectively, carrying linkers "L" 
near the 3' position and affinity groups "A" at the respec- 

5 tive5" positions. These primers deliver the products (2h) 
and (2j) if annealed to the products (2e) and (2f) and 
complementarily copied many times by the polymerase. 
If in following thermocycles the primers (2g) and (2i) of 
the second pair now anneal to the linker-containing 

10 products (2h) and (2j) as templates, the relatively short 
products (2k) and (21) are produced because the copy- 
ing process stops at the position of the linker in the tem- 
plate. After sufficient thermocycles of the PCR process, 
the linker-containing products (2h), (2j), (2k), and (21) 

15 (plus some unused second primers) are extracted by af- 
finity bonding the affinity group "A" to a suitable sub- 
strate. Washing and cleaving under a UV lamp produces 
the final products (2m), (2n), (2o), and (2p) which are 
analyzed by MALDI. The cleaving process leaves be- 

20 hind a phosphate group (designated by "+"), adding 80 
atomic mass units after protonation. Since the longer 
products (2m) and (2o) cannot be seen because of their 
low sensitivity, only the mass signals of products (2n) 
and (2p) appear in the spectrum (possibly together with 

25 signals of the cleaved residual primers), showing the 
mass of the short products (2n) and (2p) (here 1 0 bases 
long), from which the nucleotide of the mutation site can 
be determined. Because strand and counter-strand of 
the original DNA are investigated at the same time, the 

30 determination of the mutation in the counter-strand 
forms a quality enhancement of the analysis procedure 
by automatic double-determination. If the primers (2g) 
and (2i) are not completely consumed durcing the PCR, 
these are extracted, too, and deliver some DNA quad- 

35 rumer ions. 

[0041] Figure 3 presents a similar PCR procedure, us- 
ing analytical primers with linkers "L" and analytical 
primers with blockers "B" as the analytical second pair 
of primers. Here, the final products consist of the longer 

40 products (31) and the shorter products (3m), whereby 
the products (3m) are only five bases long. These are 
called pentamers. 

[0042] Figure 4 exhibits three mass spectra of DNA 
pentamers, here produced from samples obtained by 

45 the extension of primers with linkers as shown in figure 
1 , and subsequent cleavage, therefore carrying one ad- 
ditional phosphate group (80 atomic mass units after 
protonation). The SNP is named PAN . The upper spec- 
trum shows the heterozygous case, the two lower spec- 

50 tra present the two homozygous cases. In all three spec- 
tra, the leftovers of the non-elongated extension primers 
are visible as DNAquadrumer ions; these may serve as 
easy mass references. 



[0043] A first favorable embodiment of the invention 
consists in a PCR amplification which is performed with 



55 Particularly favorable embodiments 
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a mixture of a pair of normal PCR selection primers with- 
out linkers (2c) and (2d) of figure 2 and a pair of bioti- 
nylated, linker-containing analytical primers (2g) and 
(2i). Preferredly, the selection primer pair amounts to 
about 90 percent and the analytical primer pair to about 
1 0 percent of all primer pairs. Both pairs of primers may 
have the same sequence, except for the linker; but in a 
preferred embodiment the linker-containing primer pair 
is "nested" inside the DNA product of the first selection 
primer pair, thus being annealed much nearerto the mu- 
tation site, as shown in Fig 2. Most favorably, the ana- 
lytical linker-containing primers are annealed directly 
next to the mutation site, as shown for primer (2i) in Fig. 
2. 

[0044] With the 90 percent selection primers (2c) and 
(2d) without linkers, the amplifying PCR procedure pro- 
duces DNA-sequences of both strands embracing the 
mutation site, increasing the number of normal PCR 
products (2e) and (2f) in each thermocycle by a factor 
of 1 .8. But beside this exponential PCR amplification by 
the linker-free primer pair (2c) and (2d), the 10 percent 
linker-containing primers (2g) and (2i) produce the se- 
quences (2h) and (2j) as linker-containing by-products 
which cannot be amplified further in full length. 
[0045] These by-products (2h) and (2j) can only be 
linearly amplified (up to the linker position) by selection 
linkers (2c) and (2d) respectively, getting products not 
shown in Fig. 2 for reasons of clarity. If it then happens 
that a linker-containing primer (2g) or (2j) anneals to 
these products or to the products (2h) or (2j) and is elon- 
gated by the polymerase, short products of the types 
(2k) or (21) are produced; the further amplification of 
these products (2k) and (21 ) is no longer possible. Thus 
the whole PCR procedure ends up with some amount 
of short biotinylated, linker-containing DNA by-products 
(2k) and (21) which are already shortened to four or five 
bases at the other end of the mutation site. 
[0046] The PCR amplification is usually performed in 
so-called thermocyclers, using microtitre plates with 384 
wells each. The thermocyclers are controlled to perform 
the thermocycles automatically. The thermocycle con- 
sist of three phases each: melting (separation of strand 
and counterstrand) by a temperature in the 90s (de- 
grees centigrade), annealing of the primers in the 50s, 
and polymerase complementary copying by prolonga- 
tion of the primer at its 3' end in the 70s. After PCR ther- 
mocycling with about 30 to 40 cycles is complete, the 
PCR solutions from the wells in the microtitre plates are 
pipetted as small droplets to special sample locations 
on a sample support plate for the mass spectrometer. 
Preferredly, these sample support plates have the size 
of microtitre plates, and contain 384 small hydrophilic 
sample anchors on an otherwise hydrophobic plate sur- 
face. The anchors have diameters of about one millim- 
eter, coated with streptavidin. The biotinylated products, 
consisting of the longer oligonucleotides (2h) and (2j), 
and the shorter oligonucleotides (2k) and (21), are im- 
mobilized by affinity bonds between the biotin and the 



streptavidin. 

[0047] The sample support plates are now thoroughly 
washed, and the linkers are cleaved by exposition to an 
UV lamp. The free cleavage products have the form of 

5 the oligonucleotides (2m), (2n), (2o), and (2p), with the 
addition of a phosphate group each stemming from the 
linkers. These free analyte products are taken up by a 
solvent drop containing the matrix substance for the 
MALDI process. Drying grows small matrix crystals with 

10 built-in analyte molecules, ready for MALDI analysis. 
During the MALDI process in the ion source of the mass 
spectrometer, the short products (2n) and (2p) which 
amount to roughly 40 percent of the free cleavage prod- 
ucts, are preferredly ionized and analyzed. 

15 [0048] The free products (2n) and (2p) consist of four 
bases each from the primer's 3' end (or the complement 
of it), and one to three bases including the mutation site. 
An optimum length for the short product is about nine to 
eleven bases. 

20 [0049] In the preferred embodiment with both linker- 
containing primers carrying the biotin group, both 
strands are analysed automatically for the mutation, in- 
creasing the accuracy of the analytical result by confir- 
mation. If only one linker-containing primer of the ana- 

25 lytical primer pair is biotinylated, only the mutation in one 
strand is analysed. 

[0050] The PCR yield and the amount of longer chains 
(2m) and (2o) in the final product mixture depends weak- 
ly on the ratio of linker-containing and linker-free primers 
30 in the mixture. If the probability for annealing is equal 
for all four primers, the highest yield of short DNA sam- 
ples for analysis, obtained with the lowest number of 
thermocycles, is achieved when a mixture with roughly 
7 % of linker-containing primers is used (exactly 6.9 %). 
35 This is the result of a mathematical simulation assuming 
equal hybridization rates for both primer pairs. But in this 
case, a 1.5-fold surplus of longer PCR products (2m) 
and (2o) is intermixed with the short chains (2n) and 
(2p). With higher percentages of linker-containing prim- 
40 ers (2g) and (2i) is used, the ratio of the short products 
(2n) and (2p) to the long products (2m) and (2o) can be 
somewhat reduced, but the total yield for these products 
is somewhat lower and requires more temperature cy- 
cles. With a higher percentage of linker-containing prim- 
es ers, the PCR yield is reduced. A compromise is a mix- 
ture of 1 0 to 20 % of linker-containing primers. The long 
products (2m) and (2o), respectively, cannot be seen in 
MALDI mass spectrometry because of their much lower 
sensitivity. In simulations, a production factor of about 
50 10 9 for the short products (2n) and (2p) is achieved in 
about 35 temperature cycles with these mixture ratios. 
[0051] The immobilization by biotin-streptavidin 
bonding can easily be replaced by other types of bond- 
ing well known to the specialists in the field. A special 
55 type of immobilization can be achieved by use of seize- 
specific adsorption, controlled by buffers. Magnetic 
beads and corresponding buffers for this purpose are 
on the market (e.g. Genopure™ from Bruker Saxonia 



7 



13 



EP 1 333 101 A1 



14 



Analytik GmbH, Leipzig). 

[0052] As indicated above, the linker-containing prim- 
ers (2g) and (2i) must not be identical with the linker- 
free primers (2c) and (2d) in the sequence of bases ex- 
cept the linker. They even can be considerably shorter. 
Linker-containing primers may be used which hybridize 
much nearer to the mutation site, on one or both sides. 
This allows forthe selection of non-interfering, non-fold- 
ing PCR selection primers for the exponential amplifica- 
tion process, and for a pair of nested, short primers con- 
taining the linkers and producing the final short prod- 
ucts. 

[0053] There are of course several variaties to this 
process. To reduce the number of thermocycles in order 
not to fade out the effectiveness of the polymerase 
which has a half life of only about 20 cycles, the well- 
known "touch-down PCR process" may be used: If the 
analytical primers are shorter than the selection primers 
and therefore show a lower optimum annealing temper- 
ature, the first PCR cycles may be performed with higher 
annealing temperature hindering the analytical primers 
to anneal. Normal PCR amplification rates with the se- 
lection primers are thus achieved. Only in later cycles, 
the annealing tempertures are lowered to hybridize the 
analytical primers with linkers (and with blockers, see 
next paragraph). The percentage of the analytical prim- 
ers may be chosen considerably higher with this touch- 
down process, and sufficient amounts of final by-prod- 
ucts can be generated with a lower number of cycles. 
The percentage of analytical primers can be in the range 
of 20 to 50 percent, but it should be considered that the 
analytical primers should be almost completely con- 
sumed in the PCR process because the non-used prim- 
ers are contained in the final products for mass spectro- 
metry measurements (see below). Commercially avail- 
able thermocyclers can be programmed to perform this 
touch-down process automatically. 
[0054] A second preferred embodiment with "block- 
ers" produces even shorter oligonucleotides for muta- 
tion analyses by mass spectrometry. The blockers are 
used in onetype (3g) of the analytical primer pair, where- 
as the other primer (3i) of the pair contains a linker. The 
original strands (3a) and (3b), some intermediate, and 
the final products (31) and (3m) of this procedure are 
presented in Figure 3. This procedure produces, after 
cleavage, extremely short oligonucleotides (3m) of only 
four to six bases in length, best suited for mass spec- 
trometric measurements. 

[0055] In detail, parts of the original DNA strands (3a) 
and (3b) are exponentially amplified by the selection 
primers (3c) and (3d) to the products (3e) and (3f). The 
blocker-containing analytical primer (3g) produces the 
by-product (3h) containing a blocker. If the linker-con- 
taining analytical primer (3i) anneals to the by-product 
(3h), the polymerase produces the linker-containing by- 
product (3k). This product can be immobilized because 
of its affinity group "A" to a suitable substrate, washed, 
and cleaved, whereby the free oligonucleotide (3m) is 



formed; the product (3m) consists of five bases only 
(plus a phosphate group from the linker) and contains 
the information which base was built into the mutation 
site of the original strand (3a). 

5 [0056] If the biotinylated primers (3i) are not completly 
consumed by the PCR process, these primers are also 
extracted, washed and cleaved together with the oli- 
gomers (3j) and (3k) , forming some tetramer ions. 
These tetramer ions have exactly known masses and 

10 may therefore serve as mass references for the mass 
determination process. Figure 4 shows this situation 
with linker-containing DNA products cleaved delivering 
pentamers, and leftover linker-containing primers 
cleaved delivering tetramers, which then serve as mass 

15 references. The measurements of figure 4 are based 
upon exactly the linkers shown in figure 1 . 
[0057] As blockers, derivatives of nucleotides can be 
used which do hinder the copying process of the 
polymerase if encountered in a template. The elonga- 
te tion process by the polymerase is controlled by the hy- 
drogen bonds between the nucleotide in the template 
and the building block to be built into the DNA chain be- 
ing elongated. It is well-known, that the hybridization or 
annealing process between strand and counterstrand 

25 forms either two or three well-defined hydrogen bridges 
between pairs of nucleotides. If now a nucleotide deriv- 
ative contains, instead of one of the four bases, a group 
forming greatly incorrect hydrogen bridges, or only one 
or even no hydrogen bond at all, it will form a blocker. 

30 The blocker-containing primer will still anneal, and the 
blocker can even be elongated if positioned atthe 3' end 
of the primer. But if encountered in the template, the 
blocker nucleotide derivative does not show the right hy- 
drogene bridge motive to find a counternucleotide; the 

35 copying process will be stopped. The biochemist in the 
field will be able, with the information given here, to find 
hundreds of different types of derivatives of nucleotides 
which can be used as blockers. 

[0058] Another type of blocker is obtained if the back- 
40 bone is derivatized to stop the copying process, e.g. by 
a phosphor thioate group instead of the normal phos- 
phate group. This group forms a somewhat weak block- 
er, and needs, for some types of polymerase, to be used 
in double oreven triple positions to reliably stopthecop- 
45 yjng process. 

[0059] The blocker-containing primers must not be 
applied in exactly the same amount as the linker-con- 
taining primers. It may be favorable to use much more 
blocker-containing primers than linker-containing prim- 
50 ers. 

[0060] The touch-down process with a PCR process 
starting at higher annealing temperatures first and then 
come down to annealing temperatures for the linker and 
blocker containing analytical primers can surely applied 
55 here, too. Also extraction processes with adsorptive 
magnetic beads (e.g. GenoPure™) instead of affinity 
groups bonded to the analytical linker-containing prim- 
ers, can be used here. 
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[0061] The process outlined here for the preparation 
of analytical oligonucleotides of short length for the in- 
vestigation of mutation sites by mass spectrometry, ex- 
poses several advantages. Firstly, only a single thermo- 
cycling phase has to be applied, saving time and effort. 
Secondly, the expensivethermosequenase as polymer- 
ase for the primer extension is no longer necessary, the 
inexpensive tac-polymerase may be used as usual for 
this amplification. Thirdly, the washing process by bond- 
ing the products to a substrate and subsequently cleav- 
ing is extremely simple. 

[0062] The PCR process of generating products suit- 
able for mass spectrometric mutation measurements 
can also be multiplexed to deliver more than one muta- 
tion-dependent product in the same process. The mul- 
tiplexing process for PCR is well-known to the specialist 
in the field. 



Claims 

1. Method for mass-spectrometric analysis of known 
mutation sites in DNA, comprising the following 
steps: 

(a) amplifying a DNA sequence containing the 
mutation site by polymerase chain reaction, us- 
ing a mixture of different types of primers 
wherein the primers of at least one type contain 
photocleavable linkers, 

(b) extracting linker-containing amplification 
products from the product solution, 

(c) cleaving the extracted linker-containing am- 
plification products, and 

(d) analyzing the cleavage products by mass 
spectrometry. 

2. Method as in Claim 1 , 

wherein the mixture of primers consists of a first and 
a second pair of primers, 

wherein the primers of the first pair do not contain 
any photocleavable linkers, and 

wherein the primers of the the second pair of 
primers contain photocleavable linkers. 

3. Method as in Claim 1 , 

wherein the mixture of primers consists of a first and 
a second pair of primers, 

wherein the primers of the first pair do not contain 
any photocleavable linkers or blockers, 
wherein the primers of one type of the the second 
pair of primers contain photocleavable linkers, and 
wherein the primers of the other type of the 
second pair of primers contain blockers. 

4. Method as in one of the claims 2 or 3 : wherein the 
linkers are located infirm positions 3 to 7 bases from 
the 3' position of the primers. 



5. Method as in one of the claims 1 to 4. wherein the 
linker stems from the class of chemical compounds 
known as o-nitrobenzyl derivatives. 

5 6. Method as in Claim 5, wherein the substance to syn- 
thesize the linker during primer synthesis is p-cya- 
noethylphosphoramidite. 

7. Method as in any of the preceding claims, wherein 
10 the primers of at least one of the linker-containing 

types of primers contains also an affinity group, and 
wherein affinity bonding to a substrate is used to 
extract, in step (b), the linker-containing primers 
carrying the affinity groups. 

15 

8. Method as in one of the claims 3 to 7. wherein the 
blocker is located in the 3' position or in the position 
next to the 3' position. 

20 9. Method as in Claims 8, wherein the blocker is a nu- 
cleoside thiophosphate. 

10. Method as in Claims 8, wherein the blocker is a nu- 
cleotide derivative not matching the hydrogen bond 

25 sites of any of the four bases. 

1 1 . Method as in one of the claims 2 to 1 0, wherein the 
ratio of the second pair of primers to the first pair of 
primers is within the range of 3 and 30 percent. 

30 

12. Method as in Claims 11 , wherein the ratio of the sec- 
ond pair of primers to the first pair of primers is with- 
in the range of 7 and 20 percent 

35 13. Method as in one of the preceding claims, wherein 
a multiplexing PCR is used with more than one an- 
alytical primer pair to products with information 
about more than one mutation site. 

40 14. Method as in one of the preceding claims, wherein 
the PCR amplification starts with higher annealing 
temperatures in the first thermocycles, and contin- 
ues later with lower annealing temperatures. 

45 15. Method as in Claim 14, wherein analytical primers 
in the range of 20 to 50 percent are used. 
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